Increasing Consensus Accuracy in DNA Fragment Assemblies by Incorporating Fluorescent Trace Representations
نویسندگان
چکیده
We present a new method for determining the consensus sequence in DNA fragment assemblies. The new method, Trace-Evidence, directly incorporates aligned ABI trace information into consensus calculations via our previously described representation, Trace-Data Classifications. The new method extracts and sums evidence indicated by the representation to determine consensus calls. Using the Trace-Evidence method results in automatically produced consensus sequences that are more accurate and less ambiguous than those produced with standard majority-voting methods. Additionally, these improvements are achieved with less coverage than required by the standard methods-using Trace-Evidence and a coverage of only three, error rates are as low as those with a coverage of over ten sequences.
منابع مشابه
Neural network input representations that produce accurate consensus sequences from DNA fragment assemblies
MOTIVATION Given inputs extracted from an aligned column of DNA bases and the underlying Perkin Elmer Applied Biosystems (ABI) fluorescent traces, our goal is to train a neural network to determine correctly the consensus base for the column. Choosing an appropriate network input representation is critical to success in this task. We empirically compare five representations; one uses only base ...
متن کاملImproving the Quality of Automatic DNA Sequence Assembly Using Fluorescent Trace-Data Classifications
Virtually all large-scale sequencing projects use automatic sequence-assembly programs to aid in the determination of DNA sequences. The computer-generated assemblies required substantial hand-editing to transform them into submissions for GenBank. As the size of sequencing projects increases, it becomes essential to improve the quality of the automated assemblies so that this time consuming ha...
متن کاملThe Value of ARMS/PCR and RFLP/PCR In Prenatal Diagnostic Accuracy Of -Thalassemia
Background: It is estimated that about 3,000 pregnancies in Iran are at risk for b-thalassemia each year. Objective: To evaluate the diagnostic accuracy of combination of ARMS/PCR and RFLP/PCR techniques in prenatal diagnosis of b-thalassemia.Methods: Sixty-seven b-thalassemia carrier families were enrolled in this study. To analyze b-globin gene, amplification refractory mutation system (ARMS...
متن کاملStrategies and Clinical Applications of Next Generation Sequencing
Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput sequencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...
متن کاملStrategies and Clinical Applications of Next Generation Sequencing
Abstract DNA sequencing is one of the great valuable techniques in molecular biology, which can be used to detect the sequence of nucleotides in a DNA fragment. The high-throughput sequencing known as Next Generation Sequencing (NGS) revolutionized genomic research and molecular biology; therefore, the whole human genome can be sequenced with a low cost in several days. NGS technology is simi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proceedings. International Conference on Intelligent Systems for Molecular Biology
دوره 5 شماره
صفحات -
تاریخ انتشار 1997